Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 7998 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 874.9 KiB |
| Average record size in memory | 112.0 B |
Variable types
| Categorical | 1 |
|---|---|
| Numeric | 13 |
Variable descriptions
| date | Erfassungszeitpunkt |
|---|---|
| co_gt | stündlich gemittelte CO-Konzentration |
| pt08_s1_co | stündlich gemittelte Sensorreaktion (nominell auf CO ausgerichtet) (Zinnoxid) |
date has a high cardinality: 7998 distinct values | High cardinality |
co_gt is highly correlated with pt08_s1_co and 6 other fields | High correlation |
pt08_s1_co is highly correlated with co_gt and 5 other fields | High correlation |
c6h6_gt is highly correlated with co_gt and 6 other fields | High correlation |
pt08_s2_nmhc is highly correlated with co_gt and 6 other fields | High correlation |
nox_gt is highly correlated with co_gt and 3 other fields | High correlation |
pt08_s3_nox is highly correlated with co_gt and 5 other fields | High correlation |
no2_gt is highly correlated with co_gt and 4 other fields | High correlation |
pt08_s4_no2 is highly correlated with pt08_s1_co and 5 other fields | High correlation |
pt08_s5_o3 is highly correlated with co_gt and 7 other fields | High correlation |
t is highly correlated with pt08_s4_no2 and 1 other fields | High correlation |
ah is highly correlated with pt08_s4_no2 and 1 other fields | High correlation |
co_gt is highly correlated with nox_gt and 1 other fields | High correlation |
pt08_s1_co is highly correlated with c6h6_gt and 6 other fields | High correlation |
c6h6_gt is highly correlated with pt08_s1_co and 6 other fields | High correlation |
pt08_s2_nmhc is highly correlated with pt08_s1_co and 6 other fields | High correlation |
nox_gt is highly correlated with co_gt and 1 other fields | High correlation |
pt08_s3_nox is highly correlated with t and 2 other fields | High correlation |
no2_gt is highly correlated with co_gt and 1 other fields | High correlation |
pt08_s4_no2 is highly correlated with pt08_s1_co and 6 other fields | High correlation |
pt08_s5_o3 is highly correlated with pt08_s1_co and 5 other fields | High correlation |
t is highly correlated with pt08_s1_co and 6 other fields | High correlation |
rh is highly correlated with pt08_s1_co and 7 other fields | High correlation |
ah is highly correlated with pt08_s1_co and 7 other fields | High correlation |
co_gt is highly correlated with nox_gt and 1 other fields | High correlation |
pt08_s1_co is highly correlated with c6h6_gt and 4 other fields | High correlation |
c6h6_gt is highly correlated with pt08_s1_co and 4 other fields | High correlation |
pt08_s2_nmhc is highly correlated with pt08_s1_co and 4 other fields | High correlation |
nox_gt is highly correlated with co_gt and 1 other fields | High correlation |
pt08_s3_nox is highly correlated with pt08_s1_co and 3 other fields | High correlation |
no2_gt is highly correlated with co_gt and 1 other fields | High correlation |
pt08_s4_no2 is highly correlated with pt08_s1_co and 2 other fields | High correlation |
pt08_s5_o3 is highly correlated with pt08_s1_co and 3 other fields | High correlation |
t is highly correlated with ah | High correlation |
ah is highly correlated with t | High correlation |
pt08_s1_co is highly correlated with nmhc_gt and 7 other fields | High correlation |
nmhc_gt is highly correlated with pt08_s1_co and 7 other fields | High correlation |
c6h6_gt is highly correlated with pt08_s1_co and 7 other fields | High correlation |
pt08_s2_nmhc is highly correlated with pt08_s1_co and 7 other fields | High correlation |
nox_gt is highly correlated with pt08_s1_co and 6 other fields | High correlation |
pt08_s3_nox is highly correlated with pt08_s1_co and 6 other fields | High correlation |
no2_gt is highly correlated with pt08_s1_co and 5 other fields | High correlation |
pt08_s4_no2 is highly correlated with pt08_s1_co and 6 other fields | High correlation |
pt08_s5_o3 is highly correlated with pt08_s1_co and 7 other fields | High correlation |
t is highly correlated with pt08_s4_no2 | High correlation |
date is uniformly distributed | Uniform |
date has unique values | Unique |
Reproduction
| Analysis started | 2022-04-29 12:01:30.578322 |
|---|---|
| Analysis finished | 2022-04-29 12:01:48.212115 |
| Duration | 17.63 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 7998 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 62.6 KiB |
| 2004-03-10 18:00:00 | 1 |
|---|---|
| 2004-10-18 17:00:00 | 1 |
| 2004-10-19 06:00:00 | 1 |
| 2004-10-19 05:00:00 | 1 |
| 2004-10-19 04:00:00 | 1 |
| Other values (7993) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 7998 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2004-03-10 18:00:00 |
|---|---|
| 2nd row | 2004-03-10 19:00:00 |
| 3rd row | 2004-03-10 20:00:00 |
| 4th row | 2004-03-10 21:00:00 |
| 5th row | 2004-03-10 22:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2004-03-10 18:00:00 | 1 | < 0.1% |
| 2004-10-18 17:00:00 | 1 | < 0.1% |
| 2004-10-19 06:00:00 | 1 | < 0.1% |
| 2004-10-19 05:00:00 | 1 | < 0.1% |
| 2004-10-19 04:00:00 | 1 | < 0.1% |
| 2004-10-19 03:00:00 | 1 | < 0.1% |
| 2004-10-19 02:00:00 | 1 | < 0.1% |
| 2004-10-19 01:00:00 | 1 | < 0.1% |
| 2004-10-19 00:00:00 | 1 | < 0.1% |
| 2004-10-18 23:00:00 | 1 | < 0.1% |
| Other values (7988) | 7988 |
Length
| Value | Count | Frequency (%) |
| 19:00:00 | 334 | 2.1% |
| 20:00:00 | 334 | 2.1% |
| 18:00:00 | 334 | 2.1% |
| 23:00:00 | 334 | 2.1% |
| 22:00:00 | 334 | 2.1% |
| 21:00:00 | 334 | 2.1% |
| 07:00:00 | 333 | 2.1% |
| 08:00:00 | 333 | 2.1% |
| 09:00:00 | 333 | 2.1% |
| 10:00:00 | 333 | 2.1% |
| Other values (348) | 12660 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
co_gt
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONstündlich gemittelte CO-Konzentration
| Distinct | 96 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -39.62661915 |
| Minimum | -200 |
|---|---|
| Maximum | 11.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1654 |
| Negative (%) | 20.7% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | -200 |
| Q1 | 0.5 |
| median | 1.5 |
| Q3 | 2.6 |
| 95-th percentile | 4.7 |
| Maximum | 11.9 |
| Range | 211.9 |
| Interquartile range (IQR) | 2.1 |
Descriptive statistics
| Standard deviation | 81.90333138 |
|---|---|
| Coefficient of variation (CV) | -2.066876588 |
| Kurtosis | 0.09589990093 |
| Mean | -39.62661915 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.447160535 |
| Sum | -316933.7 |
| Variance | 6708.155691 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 1654 | 20.7% |
| 1 | 240 | 3.0% |
| 1.6 | 227 | 2.8% |
| 1.7 | 221 | 2.8% |
| 1.4 | 215 | 2.7% |
| 1.5 | 215 | 2.7% |
| 0.7 | 214 | 2.7% |
| 1.3 | 212 | 2.7% |
| 1.2 | 209 | 2.6% |
| 1.1 | 203 | 2.5% |
| Other values (86) | 4388 |
| Value | Count | Frequency (%) |
| -200 | 1654 | |
| 0.1 | 25 | 0.3% |
| 0.2 | 37 | 0.5% |
| 0.3 | 87 | 1.1% |
| 0.4 | 134 | 1.7% |
| 0.5 | 181 | 2.3% |
| 0.6 | 197 | 2.5% |
| 0.7 | 214 | 2.7% |
| 0.8 | 197 | 2.5% |
| 0.9 | 192 | 2.4% |
| Value | Count | Frequency (%) |
| 11.9 | 1 | |
| 11.5 | 1 | |
| 10.2 | 2 | |
| 10.1 | 1 | |
| 9.9 | 1 | |
| 9.5 | 1 | |
| 9.4 | 1 | |
| 9.3 | 1 | |
| 9.2 | 1 | |
| 9.1 | 2 |
pt08_s1_co
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONstündlich gemittelte Sensorreaktion (nominell auf CO ausgerichtet) (Zinnoxid)
| Distinct | 1028 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1051.390473 |
| Minimum | -200 |
|---|---|
| Maximum | 2040 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 751.85 |
| Q1 | 917 |
| median | 1051.5 |
| Q3 | 1225 |
| 95-th percentile | 1506 |
| Maximum | 2040 |
| Range | 2240 |
| Interquartile range (IQR) | 308 |
Descriptive statistics
| Standard deviation | 324.5589983 |
|---|---|
| Coefficient of variation (CV) | 0.3086950156 |
| Kurtosis | 5.854740873 |
| Mean | 1051.390473 |
| Median Absolute Deviation (MAD) | 149.5 |
| Skewness | -1.650185639 |
| Sum | 8409021 |
| Variance | 105338.5434 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 969 | 25 | 0.3% |
| 973 | 24 | 0.3% |
| 1100 | 24 | 0.3% |
| 925 | 24 | 0.3% |
| 892 | 22 | 0.3% |
| 966 | 22 | 0.3% |
| 962 | 22 | 0.3% |
| 1050 | 22 | 0.3% |
| 1053 | 21 | 0.3% |
| Other values (1018) | 7503 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 647 | 1 | < 0.1% |
| 649 | 1 | < 0.1% |
| 655 | 1 | < 0.1% |
| 667 | 3 | < 0.1% |
| 669 | 1 | < 0.1% |
| 676 | 1 | < 0.1% |
| 678 | 1 | < 0.1% |
| 679 | 1 | < 0.1% |
| 681 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2040 | 1 | |
| 2008 | 1 | |
| 1982 | 1 | |
| 1975 | 1 | |
| 1973 | 1 | |
| 1961 | 1 | |
| 1956 | 1 | |
| 1934 | 1 | |
| 1918 | 1 | |
| 1917 | 1 |
| Distinct | 430 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -152.1387847 |
| Minimum | -200 |
|---|---|
| Maximum | 1189 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 7084 |
| Negative (%) | 88.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | -200 |
| Q1 | -200 |
| median | -200 |
| Q3 | -200 |
| 95-th percentile | 173 |
| Maximum | 1189 |
| Range | 1389 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 150.0967084 |
|---|---|
| Coefficient of variation (CV) | -0.9865775431 |
| Kurtosis | 15.54328729 |
| Mean | -152.1387847 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.716461271 |
| Sum | -1216806 |
| Variance | 22529.02188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 7084 | |
| 66 | 14 | 0.2% |
| 40 | 9 | 0.1% |
| 29 | 9 | 0.1% |
| 88 | 8 | 0.1% |
| 93 | 8 | 0.1% |
| 84 | 7 | 0.1% |
| 55 | 7 | 0.1% |
| 95 | 7 | 0.1% |
| 60 | 7 | 0.1% |
| Other values (420) | 838 | 10.5% |
| Value | Count | Frequency (%) |
| -200 | 7084 | |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 14 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 4 | 0.1% |
| 18 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1189 | 1 | |
| 1129 | 1 | |
| 1084 | 1 | |
| 1042 | 1 | |
| 974 | 1 | |
| 926 | 1 | |
| 899 | 1 | |
| 880 | 1 | |
| 872 | 1 | |
| 840 | 1 |
| Distinct | 406 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.853500875 |
| Minimum | -200 |
|---|---|
| Maximum | 63.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 0.9 |
| Q1 | 4.3 |
| median | 8.3 |
| Q3 | 14.2 |
| 95-th percentile | 24.9 |
| Maximum | 63.7 |
| Range | 263.7 |
| Interquartile range (IQR) | 9.9 |
Descriptive statistics
| Standard deviation | 39.97771659 |
|---|---|
| Coefficient of variation (CV) | 14.01005934 |
| Kurtosis | 20.99589177 |
| Mean | 2.853500875 |
| Median Absolute Deviation (MAD) | 4.6 |
| Skewness | -4.687309803 |
| Sum | 22822.3 |
| Variance | 1598.217824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 3 | 68 | 0.9% |
| 3.6 | 68 | 0.9% |
| 2.8 | 67 | 0.8% |
| 4 | 67 | 0.8% |
| 3.8 | 65 | 0.8% |
| 2.6 | 63 | 0.8% |
| 5.4 | 62 | 0.8% |
| 6 | 62 | 0.8% |
| 3.1 | 61 | 0.8% |
| Other values (396) | 7126 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 0.1 | 2 | < 0.1% |
| 0.2 | 5 | 0.1% |
| 0.3 | 7 | 0.1% |
| 0.4 | 13 | 0.2% |
| 0.5 | 15 | 0.2% |
| 0.6 | 17 | 0.2% |
| 0.7 | 25 | 0.3% |
| 0.8 | 18 | 0.2% |
| 0.9 | 19 | 0.2% |
| Value | Count | Frequency (%) |
| 63.7 | 1 | |
| 52.1 | 1 | |
| 50.8 | 1 | |
| 50.7 | 1 | |
| 50.6 | 1 | |
| 49.5 | 1 | |
| 49.4 | 1 | |
| 48.2 | 1 | |
| 47.7 | 1 | |
| 47.5 | 1 |
| Distinct | 1222 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 911.895974 |
| Minimum | -200 |
|---|---|
| Maximum | 2214 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 489.85 |
| Q1 | 727 |
| median | 911 |
| Q3 | 1122 |
| 95-th percentile | 1427.15 |
| Maximum | 2214 |
| Range | 2414 |
| Interquartile range (IQR) | 395 |
Descriptive statistics
| Standard deviation | 340.000627 |
|---|---|
| Coefficient of variation (CV) | 0.3728502337 |
| Kurtosis | 2.453179288 |
| Mean | 911.895974 |
| Median Absolute Deviation (MAD) | 197 |
| Skewness | -0.7866334169 |
| Sum | 7293344 |
| Variance | 115600.4263 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 853 | 22 | 0.3% |
| 776 | 20 | 0.3% |
| 859 | 20 | 0.3% |
| 814 | 19 | 0.2% |
| 880 | 19 | 0.2% |
| 769 | 19 | 0.2% |
| 850 | 18 | 0.2% |
| 962 | 18 | 0.2% |
| 900 | 18 | 0.2% |
| Other values (1212) | 7536 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 383 | 2 | < 0.1% |
| 388 | 1 | < 0.1% |
| 390 | 1 | < 0.1% |
| 397 | 1 | < 0.1% |
| 399 | 1 | < 0.1% |
| 402 | 1 | < 0.1% |
| 407 | 1 | < 0.1% |
| 410 | 1 | < 0.1% |
| 412 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 2214 | 1 | |
| 2007 | 1 | |
| 1983 | 1 | |
| 1981 | 1 | |
| 1980 | 1 | |
| 1959 | 1 | |
| 1958 | 1 | |
| 1935 | 1 | |
| 1924 | 1 | |
| 1920 | 1 |
| Distinct | 891 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 149.0965241 |
| Minimum | -200 |
|---|---|
| Maximum | 1479 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1604 |
| Negative (%) | 20.1% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | -200 |
| Q1 | 39 |
| median | 122 |
| Q3 | 259.75 |
| 95-th percentile | 646 |
| Maximum | 1479 |
| Range | 1679 |
| Interquartile range (IQR) | 220.75 |
Descriptive statistics
| Standard deviation | 260.6283769 |
|---|---|
| Coefficient of variation (CV) | 1.748051328 |
| Kurtosis | 1.742978378 |
| Mean | 149.0965241 |
| Median Absolute Deviation (MAD) | 104.5 |
| Skewness | 0.94132776 |
| Sum | 1192474 |
| Variance | 67927.15087 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 1604 | 20.1% |
| 89 | 39 | 0.5% |
| 65 | 37 | 0.5% |
| 41 | 36 | 0.5% |
| 57 | 32 | 0.4% |
| 51 | 32 | 0.4% |
| 61 | 31 | 0.4% |
| 180 | 31 | 0.4% |
| 46 | 31 | 0.4% |
| 93 | 31 | 0.4% |
| Other values (881) | 6094 |
| Value | Count | Frequency (%) |
| -200 | 1604 | |
| 2 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 11 | 4 | 0.1% |
| 12 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 1479 | 1 | |
| 1389 | 2 | |
| 1369 | 1 | |
| 1358 | 1 | |
| 1345 | 1 | |
| 1310 | 1 | |
| 1301 | 1 | |
| 1290 | 1 | |
| 1253 | 1 | |
| 1247 | 1 |
| Distinct | 1204 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 810.8665916 |
| Minimum | -200 |
|---|---|
| Maximum | 2683 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 429 |
| Q1 | 652 |
| median | 807 |
| Q3 | 976 |
| 95-th percentile | 1301 |
| Maximum | 2683 |
| Range | 2883 |
| Interquartile range (IQR) | 324 |
Descriptive statistics
| Standard deviation | 321.3858069 |
|---|---|
| Coefficient of variation (CV) | 0.3963485612 |
| Kurtosis | 3.216668052 |
| Mean | 810.8665916 |
| Median Absolute Deviation (MAD) | 161 |
| Skewness | -0.3326971107 |
| Sum | 6485311 |
| Variance | 103288.8369 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 733 | 24 | 0.3% |
| 846 | 24 | 0.3% |
| 767 | 22 | 0.3% |
| 816 | 21 | 0.3% |
| 800 | 21 | 0.3% |
| 876 | 20 | 0.3% |
| 685 | 20 | 0.3% |
| 765 | 20 | 0.3% |
| 748 | 19 | 0.2% |
| Other values (1194) | 7518 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 322 | 1 | < 0.1% |
| 325 | 2 | < 0.1% |
| 328 | 1 | < 0.1% |
| 330 | 1 | < 0.1% |
| 334 | 1 | < 0.1% |
| 335 | 1 | < 0.1% |
| 340 | 2 | < 0.1% |
| 341 | 1 | < 0.1% |
| 347 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2683 | 1 | |
| 2559 | 1 | |
| 2542 | 1 | |
| 2331 | 1 | |
| 2327 | 1 | |
| 2318 | 1 | |
| 2294 | 1 | |
| 2121 | 1 | |
| 2095 | 2 | |
| 2081 | 1 |
| Distinct | 268 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.15616404 |
| Minimum | -200 |
|---|---|
| Maximum | 333 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1607 |
| Negative (%) | 20.1% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | -200 |
| Q1 | 44 |
| median | 90 |
| Q3 | 125 |
| 95-th percentile | 180.15 |
| Maximum | 333 |
| Range | 533 |
| Interquartile range (IQR) | 81 |
Descriptive statistics
| Standard deviation | 129.3307646 |
|---|---|
| Coefficient of variation (CV) | 2.864077747 |
| Kurtosis | -0.1518844709 |
| Mean | 45.15616404 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | -1.124468508 |
| Sum | 361159 |
| Variance | 16726.44666 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 1607 | 20.1% |
| 97 | 72 | 0.9% |
| 95 | 69 | 0.9% |
| 101 | 67 | 0.8% |
| 96 | 66 | 0.8% |
| 114 | 65 | 0.8% |
| 121 | 65 | 0.8% |
| 107 | 64 | 0.8% |
| 119 | 64 | 0.8% |
| 110 | 64 | 0.8% |
| Other values (258) | 5795 |
| Value | Count | Frequency (%) |
| -200 | 1607 | |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 333 | 1 | |
| 322 | 1 | |
| 310 | 1 | |
| 309 | 1 | |
| 306 | 1 | |
| 301 | 1 | |
| 288 | 1 | |
| 285 | 1 | |
| 283 | 2 | |
| 282 | 2 |
| Distinct | 1550 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1445.65929 |
| Minimum | -200 |
|---|---|
| Maximum | 2775 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 807 |
| Q1 | 1275.25 |
| median | 1494 |
| Q3 | 1698 |
| 95-th percentile | 2051.15 |
| Maximum | 2775 |
| Range | 2975 |
| Interquartile range (IQR) | 422.75 |
Descriptive statistics
| Standard deviation | 455.3477118 |
|---|---|
| Coefficient of variation (CV) | 0.3149758142 |
| Kurtosis | 4.15742293 |
| Mean | 1445.65929 |
| Median Absolute Deviation (MAD) | 211 |
| Skewness | -1.42815788 |
| Sum | 11562383 |
| Variance | 207341.5386 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 1580 | 22 | 0.3% |
| 1539 | 20 | 0.3% |
| 1638 | 19 | 0.2% |
| 1488 | 19 | 0.2% |
| 1467 | 19 | 0.2% |
| 1418 | 18 | 0.2% |
| 1570 | 17 | 0.2% |
| 1511 | 17 | 0.2% |
| 1604 | 16 | 0.2% |
| Other values (1540) | 7542 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 657 | 1 | < 0.1% |
| 667 | 1 | < 0.1% |
| 668 | 1 | < 0.1% |
| 674 | 1 | < 0.1% |
| 682 | 1 | < 0.1% |
| 685 | 1 | < 0.1% |
| 697 | 1 | < 0.1% |
| 698 | 2 | < 0.1% |
| 702 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2775 | 1 | |
| 2746 | 1 | |
| 2691 | 1 | |
| 2684 | 1 | |
| 2679 | 1 | |
| 2667 | 1 | |
| 2665 | 1 | |
| 2662 | 1 | |
| 2643 | 2 | |
| 2641 | 2 |
| Distinct | 1682 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 979.9647412 |
| Minimum | -200 |
|---|---|
| Maximum | 2523 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 373 |
| Q1 | 710 |
| median | 942 |
| Q3 | 1253.75 |
| 95-th percentile | 1745.15 |
| Maximum | 2523 |
| Range | 2723 |
| Interquartile range (IQR) | 543.75 |
Descriptive statistics
| Standard deviation | 449.2001467 |
|---|---|
| Coefficient of variation (CV) | 0.4583839885 |
| Kurtosis | 0.7120722517 |
| Mean | 979.9647412 |
| Median Absolute Deviation (MAD) | 264.5 |
| Skewness | -0.009838203847 |
| Sum | 7837758 |
| Variance | 201780.7718 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 836 | 19 | 0.2% |
| 825 | 18 | 0.2% |
| 799 | 17 | 0.2% |
| 826 | 17 | 0.2% |
| 777 | 17 | 0.2% |
| 737 | 15 | 0.2% |
| 926 | 15 | 0.2% |
| 923 | 14 | 0.2% |
| 779 | 14 | 0.2% |
| Other values (1672) | 7563 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 253 | 1 | < 0.1% |
| 261 | 1 | < 0.1% |
| 263 | 1 | < 0.1% |
| 266 | 1 | < 0.1% |
| 268 | 1 | < 0.1% |
| 274 | 3 | < 0.1% |
| 282 | 1 | < 0.1% |
| 283 | 1 | < 0.1% |
| 286 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2523 | 1 | |
| 2522 | 1 | |
| 2519 | 1 | |
| 2515 | 1 | |
| 2480 | 1 | |
| 2475 | 1 | |
| 2465 | 1 | |
| 2452 | 1 | |
| 2434 | 1 | |
| 2415 | 1 |
| Distinct | 420 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.54548637 |
| Minimum | -200 |
|---|---|
| Maximum | 44.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 3.4 |
| Q1 | 12.3 |
| median | 18.7 |
| Q3 | 25.1 |
| 95-th percentile | 35.1 |
| Maximum | 44.6 |
| Range | 244.6 |
| Interquartile range (IQR) | 12.8 |
Descriptive statistics
| Standard deviation | 41.83546511 |
|---|---|
| Coefficient of variation (CV) | 3.623534234 |
| Kurtosis | 20.65979406 |
| Mean | 11.54548637 |
| Median Absolute Deviation (MAD) | 6.4 |
| Skewness | -4.641463785 |
| Sum | 92340.8 |
| Variance | 1750.206141 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 20.8 | 55 | 0.7% |
| 21.3 | 50 | 0.6% |
| 20.2 | 47 | 0.6% |
| 19.8 | 46 | 0.6% |
| 23.7 | 43 | 0.5% |
| 21.7 | 42 | 0.5% |
| 13.8 | 42 | 0.5% |
| 15.6 | 42 | 0.5% |
| 14.6 | 41 | 0.5% |
| Other values (410) | 7301 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 0.3 | 1 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.8 | 3 | < 0.1% |
| 1 | 3 | < 0.1% |
| 1.2 | 3 | < 0.1% |
| 1.3 | 4 | 0.1% |
| 1.4 | 4 | 0.1% |
| 1.5 | 2 | < 0.1% |
| 1.6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 44.6 | 1 | < 0.1% |
| 44.3 | 1 | < 0.1% |
| 43.4 | 1 | < 0.1% |
| 43.1 | 1 | < 0.1% |
| 42.8 | 3 | |
| 42.7 | 1 | < 0.1% |
| 42.6 | 1 | < 0.1% |
| 42.5 | 1 | < 0.1% |
| 42.2 | 2 | |
| 42 | 2 |
| Distinct | 749 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.97298075 |
| Minimum | -200 |
|---|---|
| Maximum | 88.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 15.5 |
| Q1 | 33.8 |
| median | 48.5 |
| Q3 | 61.6 |
| 95-th percentile | 77.1 |
| Maximum | 88.7 |
| Range | 288.7 |
| Interquartile range (IQR) | 27.8 |
Descriptive statistics
| Standard deviation | 49.46867681 |
|---|---|
| Coefficient of variation (CV) | 1.237552864 |
| Kurtosis | 17.08049849 |
| Mean | 39.97298075 |
| Median Absolute Deviation (MAD) | 13.8 |
| Skewness | -4.05847051 |
| Sum | 319703.9 |
| Variance | 2447.149985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 53.1 | 29 | 0.4% |
| 57.9 | 26 | 0.3% |
| 50.8 | 25 | 0.3% |
| 61.1 | 25 | 0.3% |
| 60.8 | 25 | 0.3% |
| 57.6 | 24 | 0.3% |
| 50.9 | 24 | 0.3% |
| 50.1 | 24 | 0.3% |
| 42.8 | 23 | 0.3% |
| Other values (739) | 7484 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 9.2 | 2 | < 0.1% |
| 9.3 | 1 | < 0.1% |
| 9.6 | 1 | < 0.1% |
| 9.8 | 1 | < 0.1% |
| 9.9 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 10.2 | 1 | < 0.1% |
| 10.7 | 1 | < 0.1% |
| 10.9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 88.7 | 1 | < 0.1% |
| 87.2 | 1 | < 0.1% |
| 87.1 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 86.6 | 1 | < 0.1% |
| 86.5 | 2 | |
| 86 | 1 | < 0.1% |
| 85.7 | 3 | |
| 85.6 | 1 | < 0.1% |
| 85.5 | 1 | < 0.1% |
| Distinct | 5918 |
|---|---|
| Distinct (%) | 74.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -6.183808665 |
| Minimum | -200 |
|---|---|
| Maximum | 2.231 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 289 |
| Negative (%) | 3.6% |
| Memory size | 62.6 KiB |
Quantile statistics
| Minimum | -200 |
|---|---|
| 5-th percentile | 0.350175 |
| Q1 | 0.7793 |
| median | 1.0316 |
| Q3 | 1.358525 |
| 95-th percentile | 1.741645 |
| Maximum | 2.231 |
| Range | 202.231 |
| Interquartile range (IQR) | 0.579225 |
Descriptive statistics
| Standard deviation | 37.53100978 |
|---|---|
| Coefficient of variation (CV) | -6.069238525 |
| Kurtosis | 22.72172521 |
| Mean | -6.183808665 |
| Median Absolute Deviation (MAD) | 0.2867 |
| Skewness | -4.971215237 |
| Sum | -49458.1017 |
| Variance | 1408.576695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -200 | 289 | 3.6% |
| 0.8394 | 6 | 0.1% |
| 1.1199 | 6 | 0.1% |
| 0.8736 | 5 | 0.1% |
| 0.9271 | 5 | 0.1% |
| 0.8325 | 5 | 0.1% |
| 0.9684 | 5 | 0.1% |
| 1.0594 | 5 | 0.1% |
| 0.8944 | 4 | 0.1% |
| 1.0551 | 4 | 0.1% |
| Other values (5908) | 7664 |
| Value | Count | Frequency (%) |
| -200 | 289 | |
| 0.1988 | 1 | < 0.1% |
| 0.2029 | 1 | < 0.1% |
| 0.218 | 1 | < 0.1% |
| 0.2185 | 1 | < 0.1% |
| 0.2193 | 1 | < 0.1% |
| 0.2397 | 1 | < 0.1% |
| 0.242 | 1 | < 0.1% |
| 0.2462 | 1 | < 0.1% |
| 0.2477 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2.231 | 1 | |
| 2.1806 | 1 | |
| 2.1766 | 1 | |
| 2.1719 | 1 | |
| 2.1395 | 1 | |
| 2.1362 | 1 | |
| 2.1247 | 1 | |
| 2.1195 | 1 | |
| 2.117 | 1 | |
| 2.1164 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| date | co_gt | pt08_s1_co | nmhc_gt | c6h6_gt | pt08_s2_nmhc | nox_gt | pt08_s3_nox | no2_gt | pt08_s4_no2 | pt08_s5_o3 | t | rh | ah | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2004-03-10 18:00:00 | 2.6 | 1360 | 150 | 11.9 | 1046 | 166 | 1056 | 113 | 1692 | 1268 | 13.6 | 48.9 | 0.7578 |
| 1 | 2004-03-10 19:00:00 | 2.0 | 1292 | 112 | 9.4 | 955 | 103 | 1174 | 92 | 1559 | 972 | 13.3 | 47.7 | 0.7255 |
| 2 | 2004-03-10 20:00:00 | 2.2 | 1402 | 88 | 9.0 | 939 | 131 | 1140 | 114 | 1555 | 1074 | 11.9 | 54.0 | 0.7502 |
| 3 | 2004-03-10 21:00:00 | 2.2 | 1376 | 80 | 9.2 | 948 | 172 | 1092 | 122 | 1584 | 1203 | 11.0 | 60.0 | 0.7867 |
| 4 | 2004-03-10 22:00:00 | 1.6 | 1272 | 51 | 6.5 | 836 | 131 | 1205 | 116 | 1490 | 1110 | 11.2 | 59.6 | 0.7888 |
| 5 | 2004-03-10 23:00:00 | 1.2 | 1197 | 38 | 4.7 | 750 | 89 | 1337 | 96 | 1393 | 949 | 11.2 | 59.2 | 0.7848 |
| 6 | 2004-03-11 00:00:00 | 1.2 | 1185 | 31 | 3.6 | 690 | 62 | 1462 | 77 | 1333 | 733 | 11.3 | 56.8 | 0.7603 |
| 7 | 2004-03-11 01:00:00 | 1.0 | 1136 | 31 | 3.3 | 672 | 62 | 1453 | 76 | 1333 | 730 | 10.7 | 60.0 | 0.7702 |
| 8 | 2004-03-11 02:00:00 | 0.9 | 1094 | 24 | 2.3 | 609 | 45 | 1579 | 60 | 1276 | 620 | 10.7 | 59.7 | 0.7648 |
| 9 | 2004-03-11 03:00:00 | 0.6 | 1010 | 19 | 1.7 | 561 | -200 | 1705 | -200 | 1235 | 501 | 10.3 | 60.2 | 0.7517 |
Last rows
| date | co_gt | pt08_s1_co | nmhc_gt | c6h6_gt | pt08_s2_nmhc | nox_gt | pt08_s3_nox | no2_gt | pt08_s4_no2 | pt08_s5_o3 | t | rh | ah | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7988 | 2005-02-06 14:00:00 | 1.0 | 868 | -200 | 2.1 | 590 | 127 | 1081 | 100 | 753 | 420 | 10.6 | 26.0 | 0.3320 |
| 7989 | 2005-02-06 15:00:00 | 0.8 | 868 | -200 | 1.9 | 576 | 96 | 1128 | 78 | 755 | 363 | 10.3 | 27.7 | 0.3481 |
| 7990 | 2005-02-06 16:00:00 | 1.0 | 904 | -200 | 2.7 | 633 | 138 | 1040 | 100 | 789 | 410 | 10.2 | 28.3 | 0.3516 |
| 7991 | 2005-02-06 17:00:00 | 1.4 | 944 | -200 | 3.7 | 693 | 217 | 928 | 150 | 832 | 568 | 9.2 | 29.9 | 0.3479 |
| 7992 | 2005-02-06 18:00:00 | 1.1 | 925 | -200 | 2.9 | 649 | 186 | 1003 | 142 | 819 | 570 | 6.9 | 36.4 | 0.3635 |
| 7993 | 2005-02-06 19:00:00 | 1.6 | 985 | -200 | 4.5 | 736 | 227 | 891 | 165 | 875 | 774 | 6.0 | 38.0 | 0.3584 |
| 7994 | 2005-02-06 20:00:00 | 1.8 | 1002 | -200 | 5.3 | 780 | 252 | 855 | 179 | 892 | 857 | 5.8 | 36.4 | 0.3385 |
| 7995 | 2005-02-06 21:00:00 | 1.4 | 938 | -200 | 3.7 | 692 | 193 | 937 | 149 | 805 | 737 | 5.8 | 35.4 | 0.3286 |
| 7996 | 2005-02-06 22:00:00 | 1.1 | 896 | -200 | 2.6 | 627 | 158 | 1033 | 126 | 782 | 610 | 5.4 | 36.6 | 0.3304 |
| 7997 | 2005-02-06 23:00:00 | 1.0 | 907 | -200 | 2.4 | 614 | 150 | 1052 | 120 | 782 | 627 | 5.1 | 37.9 | 0.3358 |